Rank in Wordlist | Frequency | Word |
---|---|---|
5171 | 101 | %, |
9202 | 52 | е,че |
11232 | 40 | 1,5 |
11716 | 38 | 2,5 |
12098 | 37 | народ,който |
12712 | 35 | пищялки,само |
12884 | 34 | глупав,то |
15146 | 28 | се,че |
15178 | 28 | това,че |
17191 | 23 | 1,2 |
Rank in Wordlist | Frequency | Word |
---|---|---|
116468 | 1 | %( |
218169 | 1 | „( |
Rank in Wordlist | Frequency | Word |
---|---|---|
19794 | 19 | %) |
84510 | 2 | $) |
116366 | 1 | $$$$$) |
Rank in Wordlist | Frequency | Word |
---|---|---|
1330 | 388 | 10% |
1664 | 315 | 20% |
1801 | 294 | 100% |
1819 | 291 | 50% |
1964 | 272 | 30% |
2412 | 221 | 90% |
2492 | 214 | 40% |
2517 | 212 | 5% |
3328 | 161 | 2% |
3481 | 154 | 3% |
Rank in Wordlist | Frequency | Word |
---|---|---|
6407 | 79 | S&P |
25965 | 13 | AT&T |
29265 | 11 | H&M |
51406 | 5 | Принс&Дана |
51536 | 5 | С&П |
57678 | 4 | R&B |
58778 | 4 | КТ&G |
68879 | 3 | L&M |
69009 | 3 | Standard&Poor's |
86536 | 2 | Booz&Co |
Rank in Wordlist | Frequency | Word |
---|---|---|
33471 | 9 | $2 |
36195 | 8 | $1 |
39572 | 7 | $. |
39573 | 7 | $100 |
39574 | 7 | $5000 |
43851 | 6 | $, |
43852 | 6 | $200 |
43853 | 6 | $2000 |
49340 | 5 | $150 |
49341 | 5 | $40 |
Rank in Wordlist | Frequency | Word |
---|---|---|
84531 | 2 | %" |
116469 | 1 | %," |
Rank in Wordlist | Frequency | Word |
---|---|---|
12065 | 37 | к'во |
12229 | 36 | Moody's |
14794 | 28 | Poor's |
17708 | 23 | т'ва |
20677 | 18 | Кот д'Ивоар |
20879 | 18 | д'Ивоар |
21940 | 17 | мат'ряла |
28284 | 12 | к'вото |
29419 | 11 | Ето'о |
30261 | 11 | мат'рял |
Rank in Wordlist | Frequency | Word |
---|---|---|
7405 | 67 | и/или |
8578 | 57 | км/ч |
9012 | 53 | 1/3 |
11959 | 37 | 2/3 |
13654 | 32 | с/у |
18094 | 22 | км/ч. |
18450 | 21 | БКП/ДС |
19435 | 20 | м/у |
20208 | 19 | л/100 |
22430 | 16 | 3/4 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots